NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Sample Complexity of Linear Regression Models for Opinion Formation in Networks

https://doi.org/10.1609/aaai.v39i13.33531

Liu, Haolin; Rajaraman, Rajmohan; Sundaram, Ravi; Vullikanti, Anil Kumar; Wasim, Omer; Xu, Haifeng (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Consider public health officials aiming to spread awareness about a new vaccine in a community interconnected by a social network. How can they distribute information with minimal resources, so as to avoid polarization and ensure community-wide convergence of opinion? To tackle such challenges, we initiate the study of sample complexity of opinion formation in networks. Our framework is built on the recognized opinion formation game, where we regard each agent’s opinion as a data-derived model, unlike previous works that treat opinions as data-independent scalars. The opinion model for every agent is initially learned from its local samples and evolves game-theoretically as all agents communicate with neighbors and revise their models towards an equilibrium. Our focus is on the sample complexity needed to ensure that the opinions converge to an equilibrium such that every agent’s final model has low generalization error. Our paper has two main technical results. First, we present a novel polynomial time optimization framework to quantify the total sample complexity for arbitrary networks, when the underlying learning problem is (generalized) linear regression. Second, we leverage this optimization to study the network gain which measures the improvement of sample complexity when learning over a network compared to that in isolation. Towards this end, we derive network gain bounds for various network classes including cliques, star graphs, and random regular graphs. Additionally, our framework provides a method to study sample distribution within the network, suggesting that it is sufficient to allocate samples inversely to the degree. Empirical results on both synthetic and real-world networks strongly support our theoretical findings.
more » « less
Free, publicly-accessible full text available April 11, 2026
Online Balanced Allocation of Dynamic Components

https://doi.org/10.4230/lipics.itcs.2025.81

Rajaraman, Rajmohan; Wasim, Omer (January 2025, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Meka, Raghu (Ed.)
We introduce Online Balanced Allocation of Dynamic Components (OBADC), a problem motivated by the practical challenge of dynamic resource allocation for large-scale distributed applications. In OBADC, we need to allocate a dynamic set of at most k𝓁 vertices (representing processes) in 𝓁 > 0 clusters. We consider an over-provisioned setup in which each cluster can hold at most k(1+ε) vertices, for an arbitrary constant ε > 0. The communication requirements among the vertices are modeled by the notion of a dynamically changing component, which is a subset of vertices that need to be co-located in the same cluster. At each time t, a request r_t of one of the following types arrives: 1) insertion of a vertex v forming a singleton component v at unit cost. 2) merge of (u,v) requiring that the components containing u and v be merged and co-located thereafter. 3) deletion of an existing vertex v at zero cost. Before serving any request, an algorithm can migrate vertices from one cluster to another, at a unit migration cost per vertex. We seek an online algorithm to minimize the total migration cost incurred for an arbitrary request sequence σ = (r_t)_{t > 0}, while simultaneously minimizing the number of clusters utilized. We analyze competitiveness with respect to an optimal clairvoyant offline algorithm with identical (over-provisioned) capacity constraints. We give an O(log k)-competitive algorithm for OBADC, and a matching lower-bound. The number of clusters utilized by our algorithm is always within a (2+ε) factor of the minimum. Furthermore, in a resource augmented setting where the optimal offline algorithm is constrained to capacity k per cluster, our algorithm obtains O(log k) competitiveness and utilizes a number of clusters within (1+ε) factor of the minimum. We also consider OBADC in the context of machine-learned predictions, where for each newly inserted vertex v at time t: i) with probability η > 0, the set of vertices (that exist at time t) in the component of v is revealed and, ii) with probability 1-η, no information is revealed. For OBADC with predictions, we give a O(1)-consistent and O(min(log 1/(η), log k))-robust algorithm.
more » « less
Full Text Available
One-Way Communication Complexity of Minimum Vertex Cover in General Graphs

https://doi.org/10.4230/lipics.icalp.2025.66

Derakhshan, Mahsa; Ghasemi, Andisheh; Rajaraman, Rajmohan (January 2025, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Censor-Hillel, Keren; Grandoni, Fabrizio; Ouaknine, Joel; Puppis, Gabriele (Ed.)
We study the communication complexity of the Minimum Vertex Cover (MVC) problem on general graphs within the k-party one-way communication model. Edges of an arbitrary n-vertex graph are distributed among k parties. The objective is for the parties to collectively find a small vertex cover of the graph while adhering to a communication protocol where each party sequentially sends a message to the next until the last party outputs a valid vertex cover of the whole graph. We are particularly interested in the trade-off between the size of the messages sent and the approximation ratio of the output solution. It is straightforward to see that any constant approximation protocol for MVC requires communicating Ω(n) bits. Additionally, there exists a trivial 2-approximation protocol where the parties collectively find a maximal matching of the graph greedily and return the subset of vertices matched. This raises a natural question: What is the best approximation ratio achievable using optimal communication of O(n)? We design a protocol with an approximation ratio of (2-2^{-k+1}+ε) and O(n) communication for any desirably small constant ε > 0, which is strictly better than 2 for any constant number of parties. Moreover, we show that achieving an approximation ratio smaller than 3/2 for the two-party case requires n^{1 + Ω(1/lg lg n)} communication, thereby establishing the tightness of our protocol for two parties. A notable aspect of our protocol is that no edges are communicated between the parties. Instead, for any 1 ≤ i < k, the i-th party only communicates a constant number of vertex covers for all edges assigned to the first i parties. An interesting consequence is that the communication cost of our protocol is O(n) bits, as opposed to the typical Ω(nlog n) bits required for many graph problems, such as maximum matching, where protocols commonly involve communicating edges.
more » « less
Full Text Available
Competitive Capacitated Online Recoloring

Rajaraman, Rajmohan; Wasim, Omer (September 2024, ESA 2024; Schloss Dagstuhl – Leibniz-Zentrum für Informatik)

Full Text Available
Competitive Data-Structure Dynamization

Mathieu, Claire; Rajaraman, Rajmohan; Young, Neal; Yousefi, Arman (October 2024, ACM transactions on algorithms)

Full Text Available
Scheduling Splittable Jobs on Configurable Machines

Casey, Matthew; Rajaraman, Rajmohan; Stalfa, David; Tan, Cheng (August 2024, APPROX 2024; Schloss Dagstuhl - Leibniz-Zentrum für Informatik)

Full Text Available
Online Paging with Heterogeneous Cache Slots

https://doi.org/10.1007/s00453-024-01270-z

Chrobak, Marek; Haney, Samuel; Liaee, Mehraneh; Panigrahi, Debmalya; Rajaraman, Rajmohan; Sundaram, Ravi; Young, Neal_E (October 2024, Algorithmica)

Abstract It is natural to generalize the online$$k$$ $k$ -Server problem by allowing each request to specify not only a pointp, but also a subsetSof servers that may serve it. To date, only a few special cases of this problem have been studied. The objective of the work presented in this paper has been to more systematically explore this generalization in the case of uniform and star metrics. For uniform metrics, the problem is equivalent to a generalization of Paging in which each request specifies not only a pagep, but also a subsetSof cache slots, and is satisfied by having a copy ofpin some slot inS. We call this problemSlot-Heterogenous Paging. In realistic settings only certain subsets of cache slots or servers would appear in requests. Therefore we parameterize the problem by specifying a family$${\mathcal {S}}\subseteq 2^{[k]}$$ $S \subseteq 2^{[k]}$ of requestable slot sets, and we establish bounds on the competitive ratio as a function of the cache sizekand family$${\mathcal {S}}$$ $S$ :If all request sets are allowed ($${\mathcal {S}}=2^{[k]}\setminus \{\emptyset \}$$ $S = 2^{[k]} \ {\emptyset}$ ), the optimal deterministic and randomized competitive ratios are exponentially worse than for standard Paging ($${\mathcal {S}}=\{[k]\}$$ $S = {[k]}$ ).As a function of$$|{\mathcal {S}}|$$ $| S |$ andk, the optimal deterministic ratio is polynomial: at most$$O(k^2|{\mathcal {S}}|)$$ $O (k^{2} | S |)$ and at least$$\Omega (\sqrt{|{\mathcal {S}}|})$$ $Ω (\sqrt{| S |})$ .For any laminar family$${\mathcal {S}}$$ $S$ of heighth, the optimal ratios areO(hk) (deterministic) and$$O(h^2\log k)$$ $O (h^{2} log k)$ (randomized).The special case of laminar$${\mathcal {S}}$$ $S$ that we callAll-or-One Pagingextends standard Paging by allowing each request to specify a specific slot to put the requested page in. The optimal deterministic ratio forweightedAll-or-One Paging is$$\Theta (k)$$ $Θ (k)$ . Offline All-or-One Paging is$$\mathbb{N}\mathbb{P}$$ $N P$ -hard.Some results for the laminar case are shown via a reduction to the generalization of Paging in which each request specifies a set$$P$$ $P$ ofpages, and is satisfied by fetching any page from$$P$$ $P$ into the cache. The optimal ratios for the latter problem (with laminar family of heighth) are at mosthk(deterministic) and$$hH_k$$ $h H_{k}$ (randomized).
more » « less
Greedy Peering in P2P Networks

Kiffer, Lucianna; Rajaraman, Rajmohan (May 2024, SIROCCO 2024; Springer)

Full Text Available
Fully Dynamic (Δ + 1)-Coloring Against Adaptive Adversaries

https://doi.org/10.1137/1.9781611978322.169

Behnezhad, Soheil; Rajaraman, Rajmohan; Wasim, Omer (January 2025, Society for Industrial and Applied Mathematics)

Full Text Available
One Tree to Rule Them All: Poly-Logarithmic Universal Steiner Tree

https://doi.org/10.1109/FOCS57990.2023.00012

Busch, Costas; Chen, Da Qi; Filtser, Arnold; Hathcock, Daniel; Hershkowitz, D Ellis; Rajaraman, Rajmohan (November 2023, IEEE)

Full Text Available

« Prev Next »

Search for: All records